Picture for Guangxiang Zhao

Guangxiang Zhao

A Primer in Post-Training Reasoning Data: What We Know About How It Works

Add code
Jun 01, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

Add code
Apr 23, 2026
Viaarxiv icon

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Add code
Feb 12, 2026
Viaarxiv icon

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Add code
Jun 05, 2025
Viaarxiv icon

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Add code
Mar 06, 2025
Figure 1 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 2 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 3 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 4 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Viaarxiv icon

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Add code
Feb 28, 2025
Figure 1 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 2 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 3 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 4 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Viaarxiv icon

LongAttn: Selecting Long-context Training Data via Token-level Attention

Add code
Feb 24, 2025
Figure 1 for LongAttn: Selecting Long-context Training Data via Token-level Attention
Figure 2 for LongAttn: Selecting Long-context Training Data via Token-level Attention
Figure 3 for LongAttn: Selecting Long-context Training Data via Token-level Attention
Figure 4 for LongAttn: Selecting Long-context Training Data via Token-level Attention
Viaarxiv icon

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Add code
Feb 18, 2025
Viaarxiv icon

When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning

Add code
Jan 25, 2023
Figure 1 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 2 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 3 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 4 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Viaarxiv icon